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Abstract — We consider the multiuser successive refinement (MSR) 
problem, where the users are connected to a central server via links 
with different noiseless capacities, and each user wishes to reconstruct in 
a successive-refinement fashion. An achievable region is given for the two- 
user two-layer case and it provides the complete rate-distortion region 
for the Gaussian source under the MSE distortion measure. The key 
observation is that this problem includes the multiple description (MD) 
problem (with two descriptions) as a subsystem, and the techniques 
useful in the MD problem can be extended to this case. It is shown 
that the coding scheme based on the universality of random binning 
is sub-optimal, because multiple Gaussian side informations only at the 
decoders do incur performance loss, in contrast to the case of single side 
information at the decoder. It is further shown that unlike the single 
user case, when there are multiple users, the loss of performance by a 
multistage coding approach can be unbounded for the Gaussian source. 
The result suggests that in such a setting, the benefit of using successive 
refinement is not likely to justify the accompanying performance loss. The 
MSR problem is also related to the source coding problem where each 
decoder has its individual side information, while the encoder has the 
complete set of the side informations. The MSR problem further includes 
several variations of the MD problem, for which the specialization of the 
general result is investigated and the implication is discussed. 

Index Terms — Multiple description coding, rate distortion, source 
coding, successive refinement. 



I. Introduction 

Multiuser information theory has attracted much attention recently 
because of the growth in the complexity and capability of the practical 
communication networks. In this work, we consider the multiuser 
successive refinement (MSR) problem formulated by Pradhan and 
Ramchandran in [1]. In this problem, a server is to provide multi- 
media data to users connected to the server through channels with 
different (noiseless) capacities, e.g., a dial-up connection vs. a high- 
speed cable connection. The server performs the transmission in a 
broadcasting manner in order to reduce operating cost, and thus 
the users with bad channels will only receive a (known) subset of 
the bitstream, while the users with good channels will be able to 
receive the complete bitstream. Furthermore, to reduce the delay 
for each user, the server would also like to provide the bitstream 
in a successive refinement fashion user-wise. The "multiusers" in 
the MSR problem thus receive degraded message sets, while the 
"successive refinement" refers to the fact that there are multiple 
rounds (layers) of such transmissions. 

A diagram is given in Fig. Q] for a system with two users and 
two layers. We will assume the user with good channel connection 
will remain so for the complete transmission, however the exact rates 
Tin (the first subscript specifies which user and the second subscript 
specifies which round of transmission), R12, R21, R22 can vary. If the 
transmission rate is fixed during the transmission, then = ^p-, 
which is a special case of this general setting; this special case is 
important in practice as we expect that the channels between the 
transmitter and the receivers remain the same over the two rounds of 
transmission. We will only consider the two-user two-layer system 
in this work. 

This work was presented in part at 2006 IEEE Information Theory 
Workshop in Oct. 2006, Chengdu, China. 
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Fig. 1. A system diagram with two users and two layers. The "good" user 
(user 2) receives the complete message while the "bad" user (user 1) only 
receives a subset of the message. There are two rounds of transmissions, 
illustrated by the dotted curve and dash-dot curve, and in each round the 
users form their reconstructions accordingly. 



The notion of successive refinement of information in the single 
user setting was introduced by Koshelev [2] and by Equitz and 
Cover [3] (see also [4]), and the problem is well researched. The 
main question is whether the requirement of encoding a source 
progressively necessitates a higher rate than encoding without the 
progressive requirement. A source is successively refinable if en- 
coding in multiple stages incurs no rate loss as compared with 
optimal rate-distortion encoding at the separate distortion levels; i.e., 
when the necessary encoding rate does not increase comparing to a 
single stage coding. The reassuring result by Equitz and Cover is 
that many familiar sources, such as the Gaussian source under the 
mean squared error (MSE) distortion measure and discrete sources 
under Hamming distortion measure, are in fact successively refinable. 
Lastras and Berger [5] further showed that when the source has a real 
alphabet and the distortion measure is MSE, even when the source 
is not successively refinable, the rate loss is bounded by a universal 
constant. 

In the multiuser setting we are interested in understanding whether 
the progressive coding requirement necessities any performance loss, 
and if so, whether the loss is bounded. In this work, we provide 
an achievable rate-distortion region for the problem with two users 
and two layers by embedding a (two-description) multiple description 
problerrQ inside it, and show that this region is tight for the Gaussian 
case. Furthermore, the loss of performance to a single layer coding 
can indeed be unbounded, which suggests unless there is a significant 
reason calling for a progressive coding, the loss of performance makes 
it a less attractive system design. 

The MSR problem includes the problem of multiple descriptions 
(MD) as a subsystem, and the techniques in the MD literature 
(notably [6] and [7]), are our main tools in this work. We show 
that the coding scheme given in [1] based on random binning is 
sub-optimal, because multiple Gaussian side informations only at the 
decoders incur performance loss compared to the side information 
also available at the encoder; this is in contrast to the case of single 
side information at the decoder, where there is no essential loss 

'The MSR problem is in fact a special case of the general problem of 
multiple descriptions with more than two descriptions; more precisely, there 
are four descriptions in the systems, but only four distortion requirements 
are considered, instead of one distortion constraint for each non-empty subset 
of the four descriptions. From here on, we shall use MD to stand for the 
conventional two description problem instead of the more general setting, 
unless specified otherwise explicitly. 
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[8]. The MSR problem is also related to the problem considered 
in [9], where each decoder has its individual side information, and 
the encoder has the complete set of side informations. The MSR 
problem further includes several variations of the MD problem, such 
as the MD problem with central refinement (MDCR) [10], as well as 
the conditional MD problem. We will discuss the MDCR problem in 
detail and reveal the implication of the general result from MSR. 

The distortion-rate (D-R) region and the rate-distortion (R-D) 
region given for the Gaussian MSR problem can be easily reduced 
to those for the MD problem. Though the Gaussian MD region has 
been known for more than 25 years, as pointed out in [11] (discussed 
more extensively in [12]), the expressions given in the literature are 
usually not complete (even incorrect if being used without caution) 
and we hope this confusion can be clarified by the present worlfl 

It is worth pointing out that the formulation considered in this 
work is from the source coding point of view, which implies a coding 
system where source and channel coding are separated. However, it is 
well known that under the degraded broadcast channel, for which the 
considered coding approach is arguably the most suitable, a source- 
channel separation approach is not optimal (see, for example [13]). It 
is nevertheless useful to consider the current formulation, since firstly 
joint source-channel coding (JSCC) schemes are often more complex, 
and secondly the performance using source-channel separation can 
be used to compare with that of any JSCC schemes to measure the 
possible performance loss. Moreover, in [14] and [15], it was shown a 
source channel separation indeed holds in the scenario of successive 
refinement coding with side information at the decoder. 

The rest of the paper is organized as follows. In Section [XT] the 
problem is formally defined and some related background is given. 
An achievable region is given in Section [Til] In Section IWl we prove 
that the given achievable region is tight for the Gaussian source, then 
analyze the performance loss comparing with single layer coding 
and discuss a special case with fixed channel configuration. Section 
|Vl discusses the MDCR problem as a special case of the problem 
being treated and Section IVll concludes the paper. 

II. Problem Definition 

Let A" be a finite set and let X n be the set of all n-vectors 
with components in X. Denote an arbitrary member of X n as 
x n — (xi,X2, ...,x n ), or alternatively as x; (xi, X2,---,Xj) will also 
be written as Xi.....j. Upper case is used for random variables and 
vectors. A discrete memoryless source (DMS) (X, Px) is an infinite 
sequence {Xi\°^L 1 of independent copies of a random variable X in 
X with a generic distribution Px with Px(x n ) = n™=i Px(xi). 
Let A 1 be a finite reconstruction alphabet, and for simplicity we 
assume that the decoders all use this reconstruction alphabet. Let 
d : X x X —> [0, oo) be a distortion measure. The single-letter 
distortion of a vector is defined as 
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Instead of directly considering the system depicted in Fig. [TJ we 
consider the equivalent system given in Fig. [2] The reformulation is 
crucial, which makes the rather involved relations between descrip- 
tions more explicit. The double subscript in Fig. [TJ is simplified to 
single subscript, whose correspondence is made clear in Table Q] 

Definition 1: An (n, Mi. .a, D1...4) MSR code for source X 
consists of 4 encoding functions (j>i and 4 decoding functions ipi, 

2 Feng and Effros clarified the Gaussian MD region in terms of R-D 
characterization in [11], but the interpretation of the degenerate region was 
not made explicit. 
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Fig. 2. The equivalent system diagram to the two-user two-layer system 
depicted in Fig. [TJ 

TABLE I 

The correspondence between system of FicQJand that in Fig. [2] 



double subscript 


(11) 


(21) 


(12) 


(22) 


single subscript 


1 


2 


3 


4 



i = 1,2,3,4: 



X n -> I M ., i = 1,2,3,4 



where Ik — {1, 2, fc} and 

^1 : I Ml — > X n , i> 2 : I Ml x 7m 2 -> X n , 

ip 3 : I Ml x I Ms — > X n , I Ml x I m 3 x Im 3 x I Mi -> X n , 

such that 

Ed(A"\ViW>ipn)) < D u 

Ed(X n ,MMX"),MX n ))) < D 2 , 

Ed(X n ,MMX n ),MX n ))) < Da, 

Ed(x n ,MMx n ),Mx n ),Mx n ),Mx n ))) < d 4 , 

where E is the expectation operation. 

For the rest of the paper, we will often refer to the output of 
the encoding function 4>i as description i. All the logarithms and 
exponentials are base e. 

Definition 2: A rate distortion eight-tuple (R1...4, -D1...4) is said 
to be achievable, if for any S > and sufficiently large n, there exist 
an (n, Mi. .. 4 , Di +S,D 2 + S, D 3 +5,D 4 + S) MSR code, such that 
^ + S > i log(Mi) for i = 1,2, 3, 4. 

The MSR rate-distortion region, denoted by Q, is the set of all 
achievable eight- tuples. The problem of characterizing this region 
is difficult in general, because the problem at hand can be reduced 
to the well-known multiple description (MD) problem, which is a 
long standing open problem. In the MD problem, one sends two 
descriptions over two unreliable channels, either of which can break 
down; the question is to characterize the achievable rate-distortion 
quintuple consisting of the two description rates and the distortions 
of individual description, as well as that resulting from the two 
descriptions jointly. To reduce the MSR problem to the MD problem, 
we only need to set Ri — R4 = and Di — 00. 

The literature on the MD problem is vast (see [16] for a review) 
and new results are emerging, but the problem remains open. Inner 
bound exists [7] and it was shown that this bound is in fact quite good 
for source with real alphabet under MSE distortion measure [17], 
but it is not tight in general [18]. The Gaussian source with MSE 
distortion measure (and the recent extension to the vector Gaussian 
problem [19]), for which this inner bound is tight, is the only case 
that the R-D region is completely characterized. Given these facts, 
our focus will not be on finding a complete solution for the general 
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MSR problem. Instead, we will extend the coding scheme in [7] to 
give an achievable region, and then focus on the quadratic Gaussian 
case, for which the achievable region is indeed tight. 

We now briefly outline the coding scheme given by El Gamal 
and Cover in [7] for the MD problem: given joint distribution 
Pxx 1 x 2 x 3 , generate two length-n codebooks using the marginals 
Px 1 and Px 2 , respectively. It is well known that if approximately 
exTp(nI (X; Xi)) and exp(n/(X; X 2 )) codewords are generated for 
the two codebooks, respectively, then with high probability we can 
find codewords X™, respectively X 2 , jointly typical with any X n 
vector in the individual codebook. However, to guarantee the chosen 
codewords X™, X% are also jointly typical together with X n , the 
codebook sizes have to increase. The resulting increased rate is the 
expense paid to "match" X" and XJ . Then for the matched X™ and 
X£ vector, a conditional codebook using Px 3 \x 1 x 2 can be further 
added. The decoders then use the codewords X™, X"% and XJ as 
the reconstructions. 

III. An Achievable Region 

Several schemes were outlined in [1] for the MSR problem which 
can achieve several specific operating points. One of them is to treat 
X\ and X 2 , which are the reconstructions in the first layer for user 
1 and user 2, respectively, as side informations at the decoder and 
use the Wyner-Ziv binning approach [8] to generate description 3, 
and also use binning for description 4. The intuition behind this 
scheme is that the binning strategy has certain "universal" property 
that whenever the bin is sufficiently small to decode with some 
side information, it can also be decoded with better quality side 
information (see [20]). Since X2 is a better quality side information 
than Xi, user 2 can also decode the description 3, which is meant 
for user 1; furthermore, user 2 can use it to improve its estimation. 

Though the above observation is important, and perhaps provides 
the insight for the important result on symmetric N description 
problem [21], it is not optimal for the current problem. Notice that 
since the receiver 2 has access to both description 1 and description 
2, it can also reconstruct Xi, in addition to X2 (which is its desired 
reconstruction). As such, a conditional codebook on Xi is more 
suitable, since it is available at both the encoder and the decoder. 

It should now be clear that the MD coding method can be used 
in MSR, if we treat Xi as the common side information available at 
both the encoder and the decoders when encoding for X2, X3 and 
X 4 . To insure there are sufficient codewords in the Xi codebook 
such that source vectors are covered with high probability, a rate 
of Ri = I(X;Xi) + Si can be chosen (Si,i = 1,2,3 are small 
positive quantities). Conditioned on Xi, two codebooks of size 
exp(ni? 2 ) and exp(ni?3), respectively, are generated using P Xo \x 1 
and Px 3 \x 1 > as discussed in the last section, in order to find Xj and 
X3 1 (conditioned on Xi) jointly typical with X n (i.e., matched) in 
these two codebooks with high probability, the codebook sizes should 
be chosen accordingly. More precisely, we can choose 

ft + ft = I(X-,X 2 X 3 \X 1 )+I(X 2 -,X 3 \X 1 )+S2, (2) 
R 2 > /(X;X 2 |Xi), R 3 > /(X;X 3 |Xi). (3) 

Given the codeword X™ and the matched codewords X 2 l and X 3 , in 
the last coding stage a codeword in the codebook of size exp(?ii?4) 
generated by Px 4 \x 1 x-,x 3 ' s chosen which is jointly typical with the 
source vector X n and the previously chosen codewords X" , XJ and 
X 3 ; for such a purpose we can choose R4 = I(X; X4IX1X2X3) + 
^3. Now an achievable region is readily available using standard 
techniques, though it is not clear if it is optimal. 

Here we would like to bring attention to a quite subtle and often- 
overlooked fact: even when Ri — 0, the reduced problem is still 



not the same as the MD problem. Notice there are three rates 
(R2, R 3 , R4) to characterize here, instead of the two rates in the 
MD problem; it is nevertheless a special case of the general three 
description problem. This problem, which we refer to as the multiple 
descriptions with central refinement (MDCR) problem [10], will be 
treated in more depth later. Though not the same, the MDCR system 
is not unfamiliar: the coding scheme in [7] in fact uses such a 
structure. 

Given the discussion above, we next state an achievable region 
without detailed proof for the sake of brevit}0. Define the region 
Qach to be the set of all rate distortion eight-tuples (R1...4, -D1...4) 
for which there exist four random variables Xi, X 2 , X 3 , X4 in finite 
alphabet X such that 



Ed{X,Xi) <Di,i= 1,2,3,4, 

and the non-negative rate vector satisfies: 

Ri > /(X;ii) 
Y ft > /(X;XiX 2 ), 



(4) 



(5) 
(6) 



Y Ri > I(X;XiX 3 ), (7) 
Y ft > I{X;XxX 2 X 3 )+I{X 2 -,X 3 \X{) r (8) 

i=l,2,3 

Y ft > /(X;A > 1 X 2 X 3 X4)+/(X 2 ;l3|X 1 ). (9) 

1=1,2,3,4 

The following theorem provides an achievable region. 
Theorem 1: 

Qach C Q. 

If Ri — R4 = 0, Qach degenerates to the achievable region given 
by El Gamal and Cover in [7]. The region is characterized by a set of 
sum-rate bounds, instead of the individual rate for (Ri , R2, R 3 , -R4). 
It is not immediately clear that the aforementioned coding scheme 
(with individual rates) can achieve the complete region characterized 
by the sum-rates in Theorem 1 . However, a moment of thought reveals 
that the structure of this system implies that for any code with rates 
(n, r 2 , 7-3, 7-4), we can freely move the rates r 2 (and r 3 ) into n, and 
rate r± into ri,r 2 ,r3 to construct new codes; see [23] (also [4]) for 
a thorough explanation regarding a similar property in the successive 
refinement problem. Thus indeed the region given in Theorem 1 is 
achievable. 

It is not clear whether the region Qach is convex. Interestingly, 
a generalization of this region, denoted as Q' ach , is indeed convex, 
and we now provide this generalized region. Let the region Q' ac h be 
the set of all rate distortion eight-tuples (-R1...4, -D1...4) for which 
there exist four random variables Ui,U2,U 3 ,U4 in finite alphabets 
Ui,U'iMiM4 sucn that there exists deterministic functions 



gi : U\ — > X, 
.93 : Ui x Ui — > X, 



g 2 : Ui x U2 -> X (10) 
.94 : Ui x U 2 x Ui x U4 -> X (1 1) 



satisfying 



m(X,gx{U x ))<D x , Ed(X,g 2 (U 1 ,U 2 ))<D 2 , (12) 
Ud(X,ga(Ui,U 3 )) < Da, tod(X,gi(Ui, U 2 , U 3 , U4)) < D 4 (13) 

3 See also [22] for a similar result for multiple descriptions when both 
encoder and decoders have access to common side information. 
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and the non-negative rate vector satisfies: 

Ri > I(X;Ui), 

ft > I(X;U!U 2 ), 

t=l,2 

]T ft > I(X;UiU a ), 

»=1,3 

^ % > J(X;MEM + /(l/a;l7a|l7i), 

i=l,2,3 

53 ft > I(X;UiU a U3U4)+I(Ua;U 3 \Ui) 

i=l,2,3,4 



(14) 
(15) 

(16) 

(17) 

(18) 



Theorem 1': 



Q'ach C Q. 



The proof of this theorem follows the exact same line as that of 
Theorem[T] Moreover, it can be shown straightforwardly that Q ac h Q 
Q'ach' but it is not clear whether the inclusion in the other direction 
is also true. Though the region Q' ac h ls more general, it is also more 
complex due to the involvement of several decoding functions. In 
fact, as suggested in [18], the achievable region given by El Gamal 
and Cover [7] originally had a form with several decoding functions, 
which was later largely abandoned in favor of the region defined 
without such functionO The region Q' ac h ls more suitable when we 
consider the Gaussian source, and we thus include both forms here 
for completeness. 

It is not difficult to show that Q' ac h ls convex. Let 
P(U?,Ug,U§,U2\X) and P(Ul,U%,U£,Ul\X) be two condi- 
tional distributions which provide rate vectors in Q' ac h- L et Q be 
a Bernoulli random variable with Pr(Q = 0) = A and Pr(Q = 1) = 
1 — A, which is independent of everything, then it is easily seen 

\I{X;U°) + (1 - \)I{X;Ul) 
= \I(X;U?\Q = 0) + (1-\)I(X;U}\Q = 1) (19) 



I(X;U?,Q). 



(20) 



Moreover 

A/(C/ 2 ;C/3|C/i°) + (1 - \)KUl;Ul\Ul) = I(U?;U?\U?,Q). 

(21) 

Similar relations can be derived for the other mutual information 
quantities. Now define Ui = (U®,Q), and the decoding functions 
can be defined accordingly; it follows that this convex combination 
of the rate vectors is indeed in Q' ac h- 

Though we have considered discrete sources so far, the results can 
be generalized to Gaussian sources using the techniques in [24] [25]. 
In the next section, we prove a converse for the Gaussian source 
under the MSE distortion measure, and show that the region given in 
Theorem[T]is tight for the Gaussian source. It is worth clarifying that 
the converse result for the Gaussian MSR problem is not implied by 
that of the Gaussian source with common side information at both the 
encoder and the decoders [22], because the optimal first codebook in 
MSR is not necessarily a codebook generated with any single letter 
marginal distribution P$ , while the common side information is 
always an i.i.d. random variable in the setting of [22]; moreover, the 
MSR problem is further complicated by the included MDCR sub- 
system. 

IV. The Gaussian Source 

In this section, we shall focus on the Gaussian source with MSE 
distortion measure, and establish the distortion-rate as well as the 
rate-distortion region. 

^Different from the MSR problem, in the MD problem, the region defined 
without the decoding function is in fact more general; see [18]. 



A. The distortion-rate region for the Gaussian source 

Theorem 2 below gives the distortion-rate (D-R) region for the 
Gaussian source, and Theorem 2' gives the rate-distortion (R-D) 
region. Similar to the MD problem, the D-R region is simpler than 
the R-D region due to less number of degenerate regions (see [11] 
and [26]). It will be illustrated that the R-D region can be established 
from the D-R region. We shall first present the theorems, and then 
follow the approach by Ozarow [6] to establish the D-R region. 

Theorem 2: For the Gaussian source X ~ Af(0,al) under MSE 
distortion measure, the achievable distortion-rate region for rates 
(Ri , Ri , R3 , Ra) is given by 

di > cr^exp[-2J?i], (22) 

d 2 > aiexp[-2(Ri+R 3 )], (23) 

d 3 > cr^ exp[-2(i?i + R 3 )], (24) 
al exp[-2(i?i +R2 + R3+ Ri)} 



d 4 > 



1- (|V5- \/A|+) 5 



(25) 



where \x\ + = max(i,0) and 



d* = cr^ exp[— 2Ri], d~2 — min(d2, dl), d 3 = min(d3, d*) 



a 2 

exj. 

n =(i4)(i4 
di n d{ 



^ A d 2 d 3 



df 



exp[-2(R 2 + R 3 )]. 



(26) 

Define the function R(D) = i log jj. The following theorem 
describes the R-D region for the Gaussian source. 

Theorem 2': For the Gaussian source A' ~ J\f(0, oi) under 
MSE distortion measure, the achievable rate-distortion region for 
distortions (di, d 2 , d$, d^) is given as follows. 



R 1 >Rl±R C^ dl > al) )- 



(27) 



For any rates Ri > RI and R4 > 0, define d^ = d^ exp(2J?4). The 
achievable rates (R2, Rs) are given as: 



ft > R(d 2 /dl), R 3 > R(d 3 /dl), 



(28) 



ft + ft > 



where 



R(di/d'j) < d 4 < d 2 + d z - d\; 

d A > {if 1 + d 3 ~ 1 - (d*) -1 ) -1 ; 

R(di/d~i) + L otherwise, 

(29) 



^=ilog : . . ■ (30) 

2 (d* - d 4 ) 2 + (dlVll + V d 2 - dW d 3 - d 4 ) 2 

There is one degenerate case in the D-R region (when II < A), and 
there are two degenerate cases in the R-D region (the first two cases 
in d29|l). They are degenerate in the sense that any eight-tuple in those 
regions is worse than or equal to (in each component) an eight-tuple 
on the boundary of the non-degenerate region. This interpretation is 
made more explicit at the end of the forward proof for Theorem 2. 

The region given in Theorem [2] reduces to the Gaussian MD 
region, when Ri = R4 = 0. The form of this achievable region 
is not surprising, given the aforementioned achievable scheme and 
the Guassian MD region in [6] (with the additional degenerate case 
made explicit here). However, the converse is not yet clear due to 
the involvement of the coding functions <j>\ and (j>i. More precisely, 
the following two questions regarding <j>\ and <f>i, respectively, can 
be asked: 

1) Is a Gaussian codebook optimal for encoder </>i? 

2) With the additional information provided by <f>i, should the 
codebooks generated for <f>2 and (f> 3 still have the same structure 
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as the MD codebooks? In other words, should there still be 
matched codewords for (almost) every typical source sequence 
in the codebooks for 02 and 03 (or the matching can be moved 
into 04)? 

In the proof, we will show that the answers to both the questions are 
positive. The main difference from the well-known proof by Ozarow 
[6] for the Gaussian MD problem is the additional coding stages 0i 
and 04, which makes the converse more involved, and the entropy 
power inequality has to be applied twice in the proof. 

Before proceeding to the proof of Theorem [2] we illustrate the 
equivalence of the D-R and the R-D characterizations. The inequali- 
ties in d22li-i24l> are clearly equivalent to {27} and the two inequalities 
in {28}. Thus we only need to establish that with them, the region of 
triples (R2,R3,d,4) characterized by j25\ and that characterized by 
([29} are equivalent for any valid and fixed (Ri, Ri, di, d,2, (I3). This 
is illustrated geometrically in Fig. [3] It is seen that the same region 
above the surface can be described either in two regimes as in the D-R 
characterization, or in three regimes as in the R-D characterization. 
The functions given in Theorem 2 and Theorem 2' specifying the 
regions can be shown to be equivalent with some amount of algebra 
(see [11] and [12] for a brief discussion on this algebraic computation 
in the MD problem). Note that the same argument is true for the MD 
problem, simply by taking Ri = R4, — 0. 

Proof: [Theorem |2) 

Converse: The following bounds are straightforward by conventional 
rate-distortion theory: 

di > o 2 x exp(— 2Ri), 

d 2 > alexp[-2(Ri+R 2 )], 

d 3 > a^exp[-2(i?i +R a )]. (31) 



We also have 

I(X n ;X2) < /pf n ;0i020304) < #(0i 020304) 
= H(0i)+#(0 2 |0i) + #(03|0i) 

-I(<j>2-, 3 |0l) + /f(0 4 |01020 3 ) 

< n(R 1 +R 2 +R 3 + Ri)-I((p2;<p3\<i>i), 

where (a) is because X4 is determined by the encoding functions 
(<f>i, 4>2, 03, 4 ), and (b) is because conditioning reduces entropy as 
H(fa\<h.) < H(<p 3 ), H(<j> 2 \4>i) < H{4> 2 ) and H (^ihh) < 
H((f>4), and because of the cardinalities of the encoding functions. 
By converse to the source coding theorem 

di > D(-I(X n -Xl 1 )) 
n 

> a 2 x exp[-2(Ri + R 2 + R 3 + R<L)]exp[-I(ct>2;<t>3\c/>i)]. 

n 

(32) 

Because X 2 and X3 are functions of 4>i , (f>2 and 0i , 03, respectively, 
it is seen that 

/(02;03|0i)>/(X 2 n ;X 3 n |0i). (33) 

As in the proof by Ozarow [6] for the MD problem, let Y — X + 
N, where iV is zero mean Gaussian with variance e and independent 
of X. Because of the following identity 

J(X 2 ";j£?Y n |0i) 

= j(x 2 ";x 3 "|y"0i) +/(!?; H0O 

= I{XS;X2\<h.) + I(XZX2; F n |0i) - /(X 3 n ; Y"^), 
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we have 

I(X2;XS\4>i) 

> /(l 2 n ; Y n \fa) + I(XS; Y n \fa) - y»|fc) 
= h(Y n \^ 1 )~h(Y n \XZcb 1 )+h(Y n \ ( b 1 ) 

-h(Y n \XUi) - HY n \^) + h(Y n \X% XS4>i) 
= h{Y n \^) - hiY^XZfa) - h(Y n \XScj>i) 

+h(Y n \XZX%<j )1 ) 
= I(Y n ;X2<t >1 ) + I(Y n ;X2<j> 1 )-2h(Y n ) 

+h(Y n \^) + h(Y"\XSXSh) 

> I(Y n ;X?)+I(Y n ;X$)-2h(Y n ) 

+h(Y n \cf>i) + h(Y n \xSx24>i)- (34) 

Taking Y as a Gaussian source, the distortion between Y n and 
X2 is upper bounded by n{di +e), by the converse to source coding 
theorem 

I(Y n ; X 2 n ) > nR Y (d 2 + c ) = £ log ^±1, (35) 

2 d 2 + e 

where Ry(D) is the rate distortion function for source Y . Similarly 



I(Y n ;X%) > -log^±i. 
v ' A> ~ 2 6 d 3 + e 



(36) 



The following steps in our converse proof are different from 
Ozarow's, and it is worth noting the complication introduced by 
the coding function <j>i. We apply the conditional entropy power 
inequality [27] on the term h(Y n \(j>i), which gives 

KY n \4>i) > £ log[eM-h(X n \4>i)) + 27ree]. (37) 
However notice that 

/i(X' l |^) = - I(X n ; d>!) > | log(2^) - nRi, 

which gives 

h{Y n \4>i) > ^log[exp(-[~log(27re^)-nft]) + 27ree] 
Z n Z 



■log[27re(<r*exp(-2ft) +e)]. 



(38) 



Applying the entropy power inequality again on the term 
h(Y n \X5XS(f>i), we have 

fe(y n |X 2 "A70i) > ^log[exp(-h(X n \X^X?<p 1 ))+2wee]. 
2 n 

It follows that 

h(x n \xsxs^) 

= h(X"|^i)-I(jr";^i?|^i) 

( => h{X n \<t>{)- H(&$X?\4n) 

= fr(X n ) - I(X n ; (pi) - H(X2\tpi) 

-ff(*?|0i) + J(A?;X?|0i) 

> h(X n ) — I(X n ; <j>i) — H(4>2\4>i) 

-HM^ + IiX^-XS^i) 

> | \og{2itea 2 x ) - n(ft + ft + ft) + I(i$;M\<h), 

where (a) follows from the fact that X£ and are functions of 
X", and (b) from the fact X% and X£ are functions of (<f>i, <f>2 ) and 



3), respectively. This leads to 

1) 



h(Y n \X2X% 



27ree 



+ exp (~[^ log(2necr 2 x ) ~ ft + H*?; *s |&)] j 



= |log<j 27re ( exp[-2 ^ ft] exp[^i(X 2 n ; Xj^i)] + e 



Define 



^exp[H/(X 2 ";X 3 n |0i) 



(39) 



(40) 



and summarize all the bounds in {34), OS), {36]>, OH) and «[39jl, and 
we thus have 



t > 



d* + e 



(d 2 + e)(d 3 + e 
Isolating t we have 

t > 



\td{ exp[-2(ft + ft)] + e] 



(41) 



(42) 



(da + e)(d 3 + e) - (d? + e)d* exp[-2(ft + ft)] ' 

notice that because d 2 > d* exp(— 2ft) and d$ > d* exp(— 2ft) 
from d31| l, the denominator is always positive, as long as e is positive. 

To get the tightest bound, we maximize the lower bound on t over 
€. When d 2 < d\ and d 3 < d*, define 



n* 



d. 



), A* 



d 2 d 3 



d* /v ^ d*" ~ df 
Then we choose the following value of e 



exp[-2(ft + ft)]. (43) 



00 



n* > A*; 
otherwise. 



(44) 



After some algebraic calculation, we have for the case II* > A*, 

1 



1 " i-(Vn*- Va*) 2 ' 

and subsequently using l !32b . (|33), ( 140b and J45t 

exp[-2(ft + ft + ft + ft)] 



d 4 > 



i - (vn* - va*) 2 



(45) 



(46) 



For the case n* < A*, we have t > 1, which gives the trivial bound 
of 



d 4 > ff* cxp[-2(ft + ft + ft + ft)]. 



(47) 



This is not yet the bound given in Theorem [2] since the definitions of 
n and A are not the same as those of n* and A*, which are only 
defined when d 2 < d\ and d 3 < d*. To close this gap, note that if 
di > d* (or d 3 > d*), we may trivially write the lower bound ( 147 i , 
which coincides with \25\ for this case, due to the fact n = by 
the definition of d 2 (or ds). Thus the lower bound in Theorem [2] is 
established. 

Forward: Now we shall use the general achievable region given in 
Theorem 1' to derive an inner bound for the Gaussian source, and 
show it coincides with the outer bound. 
Construct the following random variables 

Ui = X + Ni,X' = X -E(X\Ui),U 2 =X' + N 2 , (48) 
U 3 = X' + N 3 , Ui = X' + N 4 , (49) 
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where N±, N 2 , N 3 , N4 are zero mean jointly Gaussian, independent 
of X, and having the covariance matrix 



(50) 



It can be seen that X' is essentially the innovation of X given Ui. 
The decoding functions are 

f 1 (U 1 )=E(X\U 1 ) = -5^-5 
















-2 2 


pa -20-3 







PCT2CT3 


















X2 



f 2 (Ui,U 2 )=X 1 +E(X'\U 2 ) 



X 3 = f 3 (Ui,U 3 )=X 1 +E(X'\U 3 ), 



We have that the following rate Ri is achievable 

fli> J(X;0i) = i]og^pi. 

Choose (Ji such that the above inequality holds with equality. Then 
we have 



di = E{X' 2 ) 
We have also 



„2 2 



o 2 exp(— 2i?i) = d*. 



d 2 = E[X - X 2 ] 2 = E[X' - E(X'\U 2 )] 2 = 
d 3 = E[X - X3] 2 = E[X' - E(X'\U 3 )] 2 = 



di + erf 
dial 



(56) 



(57) 



(58) 



Note that for optimal distortion quadruples, d2 < d* and d3 < d[ 
should be chosen, i.e., d 2 = d 2 and d3 = d3. 
Notice also 



I{X;U 2 ]U t ) 



I{X'-X' + N 2 \U- L ) ( = 5 I(X';X' + N 2 ) 



1, di + 



(59) 



where (a) is true because U\ is independent of Jf' and Choose 
o 2 such that 



i? 2 > - lOf 



di + a| 



(60) 



which can always be done because the function is continuous 
Similarly choose a\ such that 

di + al 



R 3 > - loa 



(61) 



From dl7l l. the sum rate satisfying the following bound is achiev- 
able 

R! + R 2 + Rs 
> I{X;Ui)+I(X;U 2 U 3 \U 1 ) + I(U 2 ;U 3 \Ui) 
= Ri + I(X'; U 2 U s \ Ui ) + I(X' + N 2 ; X' + N 3 1 Ui) 
= Ri+I{X';U 2 U 3 )+I{U 2 ;U 3 ) 
= R^hiN^ + hiU^ + hiUs), 

which gives 

1 



2 (l-p 2 )aiai 
When II > A, we may choose 



loa 



d 2 d 3 (l - 



(62) 



P = 



-4 1- 



d\ exp[-2(J?2 + R 3 )\ 



Then 

/(X;f/ 4 |f/iC7 2 C/ 3 ) 



I(X'; Ui\U 2 U 3 ) = h{Ui\U 2 U 3 ) - h{N 4 

1 . a\ + d\ 
-log — -5—, 





where 




Ui. 


dl = E(X' - E{X'\U 2 U Z )) 2 

d 1 alai(l-p 2 ) 




(51) 


dia 2 2 a 3 2 (l-p2)+d 1 (a 2 2 + f 7 3 2 ) 


— 2pd\o 2 o 3 


Choose 04 such that 




(52) 






(53) 


1 o\ + dl 




(54) 


We further have (and after some simplification) 

d 4 = E(X' -E(X'|t/ 2 t/3i74)) 2 




(55) 


exp(-2 J R 4 )dicrl C r|(l - 


P 2 ) 



d\o\o\(\ - p 2 ) + di(a| + a 2 ) - 2pdia 2 a 3 
a 2 exp[-2(7?i + R 2 + R 3 + Ri)] 



(64) 

(65) 

(66) 
(67) 

(68) 



Thus if IT > A, i.e., di + di exp[-2(i? 2 + R 3 )] > d 2 + d 3 , then 
this achievable region matches the outer bounds. 

When II < A, i.e., di + di cxp[-2(i? 2 + Ra)] < d 2 + d 3 , we can 
find some d' 2 < d 2 and d 3 < d3, where at least one of the inequalities 
is strict, such that di +di exp[— 2(i?2 +-R3)] = d! 2 +d 3 . To see this, 
let us first consider choosing d' m £ [di2~ 2Rm , d m ], m — 2, 3. By a 
continuity argument, d' 2 + d' 3 such chosen can take any real value in 
[di exp(— 2%) +di exp(— 2R 3 ), d 2 +d 3 ]. It is obvious that di[l + 
exp(-2(7? 2 + R 3 ))] e [di exp(-2i? 2 ) + di exp(-2J? 3 ), d 2 + d 3 ], 
and it follows there always exists at least a pair of (d' 2 , d 3 ) such that 
d' 2 + d 3 = di + di exp[-2(i? 2 + Ra)], i.e., II' = A'. Thus we can 
conclude the distortion quadruple (di, d' 2 , d' 3 , exp(— 2(i?i + R 2 + 
R 3 + Ri))) is achievable by l !68t . which implies that the quadruple 
(di, d2, d3, exp(— 2(Ri + R 2 +R 3 + R4))) is achievabl^B Therefore 
for both the case II > A and II < A the achievable region indeed 
matches the outer bounds. ■ 



B. Fixed channel configuration and the performance loss 

One case of interest is that the good channel and the bad channel 
used to transmit are fixed as Ri/R 2 = R 3 /Ra, and we will consider 
the performance loss in this case. Suppose R 2 = ctR\ and R4 = 
aR 3 . 

Consider the cases where only the first layer performance or only 
the second layer performance is in consideration. For the former case, 
i.e., the first layer, user 1 has description of rate Ri while user 2 
has joint description of rate (1 + a)R\, which results in minimum 
distortions o 2 exp(— 2i?i) and o 2 exp[— 2(1 + a)Ri]. For the latter 
case, i.e., optimized only for the second layer, user 1 has description 
of rate 7ii+J?3 while user 2 has joint description of rate (l+a)(Ri + 
R 3 ), which results in minimum distortions o 2 exp[— 2(Ri + R 3 )] and 
ol exp[-2(l+a)(i?i + J R 3 )]. 

Now for an MSR system to achieve the same minimum second 
layer distortions as if only the second layer is in consideration. Then 
by the result from the previous section, we see that II < A, which 
gives 



d 2 + 



a 2 exp[-2(i?i +R 3 )] > d![l + exp[-2(-R 2 + Ra)]], 



d 2 d 3 



This degenerate region was not treated by Ozarow in [6], and it sometimes 
causes certain confusion. 
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which further gives 

d 2 > d\[l + exp(-2(ai?i + R 3 )) - exp(-2R 3 )]. (69) 

For a single layer system optimized for the first layer, distortion d\ = 
a 2 exp(— 2Ri) and d 2 = a 2 exp[— 2(1 + a)Ri] are achievable, thus 
the loss on d 2 can be as large as 

<h_ [1 + exp(-2(qfli + R 3 )) - exp(-2i? 3 )] 

d 2 exp(— 2aRi) 

= exp(2ai?i) + exp(-27? 3 ) - exp[2(ai?i - R 3 )]. 

Thus we see that as R± — > oo, the performance loss compared to a 
single layer system can be unbounded. However, the distortion di 
is not jeopardized by the progressive encoding requirement. In other 
words (di, d 3 ,di) can be matched to an optimal coding system with 
coding rate (Ri , Ri + R 3 , (1 + a) (Ri + R 3 ) ), with the distortion d 2 
being quite large. If d 2 is of little importance, then such a system can 
be utilized; otherwise, the performance loss needed to improve d 2 is 
large, and can hardly be compensated by the added functionality. 



C. MD coding vs. Wyner-Ziv coding 

The coding approach proposed in [1] is based on the Wyner- 
Ziv (WZ) coding, which treats the reconstruction X\ and X2 as 
side informations at the decoder. In this section we compare the 
performance by the WZ-based coding approach with that by the MD- 
based coding approach. 

To compare the two coding schemes, fix di = d* = 
ol exp(-2J?i) and d 2 = d 2 = ol exp[-2(i?i + R 2 )]. For the WZ- 
based approach, since the (f> 3 and 04 are successive refinement by 
definition, the WZ-based coding is in fact the successive Wyner-Ziv 
problem with degraded side information at the decoder considered 
by Steinberg and Merhav in [28]. Though X\ and X2 are not 
necessarily physically degraded, as pointed out in [28], the achievable 
region is only dependent on the pairwise distribution between the 
source and the side information, thus statistical degradedness and 
physical degradedness have no essential difference. It is known that 
the Gaussian source and side informations can always be taken as 
statistically degraded, and thus the general result in [28] can be 
readily used. The rate-distortion region for the Gaussian source was 
given explicitly in [15], and can be (modified accordingly and) written 
as follows. Choose a\ and o\ such that 



d\ = 



„2l„2 1 „2\ 



„2 2 

Tl + al 



(70) 



and define 7 = - 
based coding are given by 



then the achievable distortions using WZ- 



d'z > ex P( — 2R 3 )d 3 = o\c exp[— 2(i?i + R 3 
d'i > exp[-2(i? 3 + Ri)] 

„2„2 2 



(71) 



{al + o-l + a|)((l - 7)2 min(d 3 , d\) + yaf) 
On the other hand, the MD-based coding approach can achieve 



(72) 



d 3 
(I -i 



> a x exp[ 



2 r 



-2(Ri ■ 
-2(Ri 



R3)], 

-R 2 + R 3 + R4)] 



1 



lVn- 



where we have for this special case 



d 



d: 



n = (1 - £)(1 - ^) = [1 - exp(-2 J R 2 )](l - 



dl 



dl 



(73) 
(74) 

(75) 



5.5 
5 

4.5 
4 

o 

3.5 
3 
2.5 



x 10 ' 



- MD-based coding 

- WZ-based coding 



0.1 0.12 0.14 



Fig. 4. Comparison of the distortion regions (d 3 ,d^) of the two coding 
approaches at Ri = R 3 = 1.0 nat and R2 = -R4 = 0.5 nat with fixed 
di = d"[ and d 2 = ^2 ■ The range of d 3 is [cr^ exp[— 2(R\ + R 3 )], d^]. 



as well as 



A = 



~ - exp[-2(i? 2 + R 3 )] 



df 



,d 3 



= exp(-2 J R 2 )[-i - exp(-27? 3 )]. 
"1 



(76) 



A set of typical tradeoff curves of (d3,d4) for the WZ-based 
approach and MD-based approach are given in Fig. [4] for fixed 
rates (Ri, R2, R 3 , R4), and we can see the gap is non-zero. As 
such, the WZ-based approach is suboptimal except two extreme 
operating points: one point is when d 3 = dl and the other one 
is d 3 — exp[— 2(Ri + -R3)], which correspond to description 
(j> 3 is either completely useless for decoder 3, or maximally useful 
for decoder 3, respectively. This in fact illustrates the role of side 
informations at only the decoders or at both the encoder and the 
decoders are quite different: it is known that for the Gaussian source 
there is no loss between the cases when a single Gaussian side 
information is available at both the encoder and the decoder, or at the 
decoder only, however when there are multiple side informations at 
different decoders, they are no longer equivalent. This observation 
perhaps was firstly made explicit in [9]. Though the Wyner-Ziv 
coding based approach is sub-optimal in this problem, it does have 
certain advantage, particularly when the second round of transmission 
is not encoded together with the first round descriptions, but is made 
possible when certain network resource becomes available after an 
initial transmission. 

The MSR problem in fact has a more subtle connection with the 
encoder/decoder side information (EDSI) problem considered in [9] 
(see also [29]), which is depicted in Fig. [5] The connection is through 
one particular special case for the EDSI problem, when the source 
and side informations are physically degraded as X <-> Y <-> Z, and 
in the Gaussian case we may write without loss of generality Y = 
X+Ni and Z — Y+N2 where JVi and are independent Gaussian 
noise. Now if we take Xi and X% in MSR as the side informations Z 
and Y, respectively, and let R4 = 0, then MSR can be considered as a 
relaxed version of the EDSI problem, because in MSR the codeword 
Xi and X2 do not have to be generated by any marginal distribution 
as specified in the EDSI problem, but we can indeed choose them to 
have such structure with single-letter distribution Pzy\ furthermore, 
in the MSR problem, decoder one always has X\ (corresponding to 
Z), rather than only A 2 (corresponding to the better side information 
Y). As such, if in the MSR system we set Ri = I(X;Z) and R2 = 
I(X; Y\Z), Di = E[X - E(X\Z)] 2 and D 2 = E[X - E(X|Y)] 2 , 
an outer bound for the EDSI problem can be found; it is an outer 
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Encoder 



Decoder 1 



■x, 



Decoder 2 



Fig. 5. The system diagram for the EDSI problem. 



not operate in the degenerate region. It is now seen that the MDCR 
approach with non-zero R4 is suboptimal because d,4 > d'4, due to 
the fact that A' > A and TJ = II. This is a stronger result than the 
known one that it is sufficient for R4 = to achieve optimality: it 
is in fact necessary for R4 to be zero in order to be optimal in 
the Gaussian MD case in general. 

This result suggests any system based on the MDCR approach 
when the refinement rate is not zero is not optimal for the Gaussian 
source: one such example is the system constructed with dithered 
lattice quantizers in [31]. 



bound since we can use Pxzy as Pxu-,_u 2 in the minimization for 
the MSR problem, but the chosen (Ri, R2, Di, D 2 ) also allows for 
other choices of random variables. As shown in [9], this outer bound 
is indeed achievable by using a hybrid conditioning/binning scheme 



V. A Variation of the MD Problem: the MDCR Problem 

As aforementioned, when Ri = 0, the problem being considered 
reduces to the MD problem with central refinement (MDCR), and 
the six-tuple of rates and distortions are to be characterized. Again 
we focus on the Gaussian case; we will continue to use the notations 
(R2, R3, R4) and descriptions 02, 03, 04 and assuming Ri = and 
no description 0i exists. 

El Gamal and Cover constructed an MD scheme for general sources 
based on the MDCR method in [7]. More precisely, the description <j>4 
of rate R4 is split and combined into the existing two descriptions 
02 and 03, and the resulting two descriptions are of rates R' 2 = 
R2 + PRa and R' 3 = R 3 + (1 - f3)R 4 for some < (3 < 1. It is 
known that in the Gaussian MD problem, there is no need for the 
central refinement coding to achieve the complete distortion region; 
i.e., R4 — is sufficient to achieve the complete distortion region 
given the two description rates R' 2 and R' 3 . A natural question to ask 
is whether it is possible to construct an optimal Gaussian MD system 
using an MDCR system with nonzero rate R4. 

The answer to the above question in fact negative, which is implied 
by Theorem[2] To see this, assume the distortion d 2 and dz in both an 
MDCR-based system and an optimal MD system, such that d 2 < d\ 
and dz < d*. By Theorem [2] we see that for an MDCR system with 
non-zero R4 



d 4 > 



aj exp[-2(fl 2 + Rz + Ra 

1 - (Vn- Va) 2 



(77) 



where 



A 



^i- eX p[-20R 2+J R3)], II = (1 (78) 



For an optimal MD system, the distortion resulting from the joint 
description can be 



d' 4 > 



<7*exp[-2(i£ + J& 



(Vn 7 - vA 7 ) 2 



(79) 



where 



A' = 



d 2 dz 



exp[-2( J R 2 + R' 3 )\, n' = (1 - ^|)(1 - (80) 



To keep the rates of the two system equal, we have R' 2 = R 2 + PR4 
and R'z = R3 + (1 — /3)i?4 for some < (3 < 1; further assume 
d 2 and dz are chosen such that II' > A', i.e., the MD system does 

6 The (same) outer bound for this case given in [9] was derived by applying 
the conditional version of the results of [30], which is indeed closely related 
to the MD problem. 



A. The high-rate asymptotics for balanced descriptions 

We consider balanced MDs in this subsection, and further assume 
0%. = 1. Suppose in an MD system, the rate of two descriptions are 
at equally high rate of R' each, and the side distortions are both 
d' 2 = d' 3 . It can be shown that if the side distortion is of the form 
d' 2 = fr2~ 2(1 ~ ,,)iJ ', where < 77 < 1 and 6 > 1, the central distortion 
of an MD system can asymptotically (at low distortion) achieve 



d'4 > 



-2R' 



/2{b + ^W 



-2«'(l + r,) 



/46 



1) ^ = 0; 

< T) < 1. 



(81) 



Notice the condition < 77 < 1 in fact corresponds to the condition 
that 1 3> d' 2 and d' 2 3> d'4 at high rate. In this case, the central and 
side distortions' product remains bounded by a constant at fixed rate, 



-, independent of the tradeoff between them. 



which is d'4d' 2 > ■ 
This product has been used as the information theoretical bound to 
measure the efficiency of quantization methods [32], [33]. Below, the 
performance of the optimal MD system is compared with that of an 
MDCR-based system in this high-rate and high-refinement-rate case. 

For an MDCR-based MD system, R4 is allocated to the refinement 
stage, and thus each of the first stage descriptions is of rate R' — R4/2. 
Keeping the side distortion of this system d 2 = d' 2 — fo2~ 2 ' 1-, '- )fl 
for an easier comparison, consider the case 1 > 77 > 0, and let 
R4 = 2771 R', where 1 — 771 is the ratio between R 2 and R' 2 . Then it 
can be shown (through some algebra) that using the MDCR approach, 
we can achieve 



d 4 > 



2- 2fl ' (1+ " ) /2(& + \/F — T) 



-2fl'(l+77) 



/4b 



771 = 77; 
< 771 < 77. 



(82) 



This implies that if the first stage has sufficient excess marginal rate, 
i.e., 771 < 77, then the performance loss from the optimal MD system 
by the MDCR approach with non-zero R4, in terms of the distortion 
product, is asymptotically zero in the range of 1 2> d 2 and d 2 d4. 
However, as the rate allocated to the refinement stage increases, the 
excess marginal rate in the first stage decreases. When 771 = 77, the 



performance loss is a factor of ■ 



If the first stage is without 



(i,+ V6 2 - 

excess marginal rate, which means 771 = 77 and 6 = 1, then the loss 
is a factor of 2 comparing to the MD system without taking such an 
MDCR approach. 

This discussion suggests that the MDCR approach is appealing 
for the high-rate case, if 1 3> d 2 and d 2 3> d4 is the desired 
operating range. However, the first stage should reserve sufficient 
excess marginal rate in order to avoid the performance loss. Taking 
the MD system in [31] as an example, using certain sub-optimal 
lattices for 02 and 03 is potentially able to achieve (asymptotic) 
optimal performance, but using two good lattices as 02 and 03 
will not be, because the excess marginal rate is diminishing as the 
dimension increases. 

VI. Conclusion 

We considered the problem of multiuser successive refinement. 
An achievable region is provided, which is shown to be tight for the 
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Gaussian source under MSE measure. It is shown that different from 
the single user case, the MSR coding necessitates performance loss, 
which can be unbounded. The results rely on the recognition that 
a multiple description system is embedded inside the MSR system. 
The MSR system also includes a variation of the MD system, namely 
the MDCR problem. This problem is treated with some depth, which 
reveals some interesting implications in designing the MD coding 
system. 

For the general problem with an arbitrary K > 2 rounds of 
transmission, or K > 2 users, an achievable region can be derived 
using the technique developed in [34] and [21]. However, even for 
the Gaussian case, the problem is highly intractable, and a complete 
characterization appears difficult. Given the results in the current 
work, we expect the loss of performance for the general case to be 
more severe than the K = 2 case. 
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